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Abstract — We introduce Schroedinger Eigenmaps, a new 
semi-supervised manifold learning and recovery technique. 
This method is based on an implementation of graph 
Schroedinger operators with appropriately constructed bar- 
rier potentials as carriers of labeled information. We use our 
approach for the analysis of standard bio-medical datasets 
and new multispectral retinal images. 

Index Terms — Schroedinger Eigenmaps, Laplacian Eigen- 
maps, Schroedinger operator on a graph, barrier potential, 
dimension reduction, manifold learning. 



I. Introduction 

Typical modern bio-medical data are characterized by 
high complexity (usually by means of high dimensionality), 
and the underlying nonlinear processes are often unknown, 
forming a fundamental barrier to better understand human 
physiology. Moreover, as personalized medicine expands, 
increasingly detailed patient data need to be integrated 
in clinical trials and individual treatment decisions, which 
usually results in a complicated classification problem. 

Data driven analysis schemes are commonly based on 
kernels that derive the proper classification directly from the 
data. However, bio-medical applications are also character- 
ized by low signal to noise ratio so that effective schemes 
require additional expert input. This outside information 
is, depending on the perspective, called labeling, (semi-) 
supervision, regularization, or learning from training data. 
Often, because of the nature of medical experiments, there 
are only few training points available and analysis schemes 
are needed that learn from these few data points most 
efficiently, while being robust against perturbations. 

Examples of kernel based analysis techniques are Locally 
Linear Embedding (LLE) ll37l . Hessian-based LLE ll23ll . 
Laplacian Eigenmaps llH, Diffusion Wavelets ifTTl . ifTSl . or 
Diffusion Maps |[T6ll . and kernel PCA and Support Vector 
Machines 1391 . These methods represent the data in form 
of a graph, with nodes that are formed by the data vectors, 
and with edges that represent the distances between pairs 
of such vectors. This information is stored in the adjacency 
matrix, which then is modified to form the kernel. 
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Physical or experimental constraints often suggest that 
the data points belong to a low-dimensional manifold. Many 
kernel-based methods recover this manifold by means of 
representations in terms of the most significant eigenvectors 
of the kernel matrix \A2\ . A major feature of Laplacian 
Eigenmaps and related methods is the preservation of local 
spectral distances, while reducing the overall dimension of 
the data represented by the aforementioned graph, cf., [|T5l . 
It has been successfully applied to pattern recognition and 
segmentation problems EOl, EH and EI, (HI. 

Laplacian Eigenmaps and similar methods originally lead 
to fully automated classification algorithms but have also 
been modified to work in semi- supervised settings: Belkin 
and Niyogi ||3l, for instance, proposed to use classi- 
fiers induced from partially labelled data on the Laplacian 
representation of unlabeled points. Belkin et al. fT\ used 
regularization in reproducing kernel Hilbert spaces to obtain 
a family of semi- supervised and transductive learning algo- 
rithms related to Laplacian Eigenmaps. In [|35l . extensions 
of the regularization methods proposed in Q are obtained 
via inverse regression. In related developments, Zhu et 
al. B9l proposed to augment the data graph with the 
results of a prior classifier. Local discriminant analysis to 
locally modify ("warp") the diffusion metric is proposed 
by Coifman et al. in ifTTl . Szlam et al. BTIl developed a 
further generalization by means of collecting all the class 
information into a modification of the metric used to build 
the kernel. 

In the present paper we develop a new Schroedinger 
Eigenmaps algorithm that builds upon Laplacian Eigen- 
maps and steers the diffusion process within the uni- 
form theory of Schroedinger operators in place of the 
Laplacian. Experts (e.g., trained practitioners) can intro- 
duce their input in form of a barrier potential for the 
associated Schroedinger operator on a graph to improve 
the detection and classification processes by steering the 
Schroedinger diffusion process effectively. At the same 
time, the new graph Schroedinger operator converges to- 
wards the Schroedinger operator on the manifold (just 
like the Laplacian kernel converges towards the Laplace- 
Beltrami operator), see Section IIII-BI 

Such defined barrier Schroedinger potentials are easy to 
implement and appear in our numerical experiments to be 
well-suited for the analysis of low quality data with only 
few training points available - a typical situation in bio- 
medical applications. After having validated the usefulness 
of Schroedinger Eigenmaps (SE) by comparison to Support 
Vector Machines (SVM) on three standard medical datasets. 
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we apply our method to analyze multispectral retinal im- 
ages and genetic expression profiles. The proposed scheme 
provides two major features for data segmentation. On one 
hand, by an appropriate selection of the potential, it allows 
the practitioner to separate data vectors, which should 
not appear in the same class. Specifically, in bio-medical 
imaging, the practitioner could be a physician who has addi- 
tional information from results of other imaging devices or 
from physiological processes and knowledge about disease 
progression. On the other hand, Schroedinger Eigenmaps 
allow to identify complex regions by means of decreasing 
the relative distances between labeled representatives. For 
the physician who has already identified several different 
regions of pathology, this technique allows to collect these 
regions into one pathology class and therefore allows to 
better indentify normal physiology. 

In Section nil we review Laplacian Eigenmaps. We intro- 
duce the new Schroedinger Eigenmaps with an investigation 
of its properties in Section [IIIl where we also visualize the 
behavior on artificial data. Section IIII-EI is dedicated to the 
description of the intertwining of Schroedinger Eigenmaps 
and classification schemes. In Section [iVl we compare 
Schroedinger Eigenmaps with Support Vector Machines 
on 3 standard medical datasets. We also analyze new 
multispectral retinal images. 

II. Laplacian Eigenmaps 

Given m points C M^, we assume that 

they belong to an n-dimensional manifold, where n is much 
smaller than N. The goal is to recover this manifold or, 
in other words, to find a low-dimensional representation 
{^1, . . . , ^m} C of the original dataset {xi, . . . , Xm} C 
R^. Here, we briefly recall the three-step procedure of 
Laplacian Eigenmaps, see ||4l: 

Step 1. Adjacency graph construction: 
Build a graph Q, whose nodes i and j are connected if 
Xi is among the /c-nearest neighbors of Xj or vice versa. 
The distances between data points are measured in the 
Euclidean metric. The graph Q represents the connectivity 
of the data, and /c is a parameter. 

Step 2. Heat kernel as weights: 
Weight the edges of the graph. A typical example is the 
diffusion weight matrix W given by 



0, 



i and j are connected, 
otherwise. 



Step 3. Solving an eigenvalue problem: 

Let D he 3. diagonal matrix with Di^i = Wij. Denote 

the low-dimensional representation by an m x n matrix 

y = . . . , ?/^)^, where each row yi is considered as 

a vector in R^. Next, consider the following minimization 

problem 



mm 

y'^Dy= 



id 



min trace(?/ Ly)^ 

y-^Dy^I 



(1) 

where L = I) — 1^ is the m x m Laplace operator and 
/ is the identity matrix. The minimizer of O is given 



by the n minimal eigenvalue solutions of Lx = XDx 
under the constraint y^ Dy = /, i.e., the columns of 
the minimizer y^^'^ are the n eigenvectors with respect to 
the smallest eigenvalues. If the graph is connected, then 
1 = (1, . . . , 1)^ is the only eigenvector with eigenvalue 
0. Instead of ([B, we solve for min trace(^^L^) subject to 
y^Dy = /, y^D^/'^l = 0. By applying z = D^^'^y, this 
yields min trace (z^ £2;) subject to z'^ z = /, z^l = 0, 
where C = D~^^'^LD~^^'^. The minimizer z*^^^ is given 
by the n eigenvectors with smallest nonzero eigenvalue, 
and we obtain the sought n-dimensional representation 
{^1,...,^^} from ^ = 1^-1/2^(0). 

Though algorithms based on spectral decomposition, 
such as Laplacian Eigenmaps, provide a powerful tool for 
non-linear dimensionality reduction and manifold learning, 
they are not without certain shortcomings. We refer the 
interested reader to, e.g., ||28l, lESJ, ||43]. 

III. Schroedinger Eigenmaps 

The idea of using solutions of the Laplace equation for 
labeling is not new. In [SO], for instance, the concept is 
utilized for image segmentation, where a small number 
of pixels is labeled, and random walk theory is then 
applied to classify unlabeled surrounding pixels. We aim 
to develop a more general classification framework by 
means of Schroedinger operators. This approach allows 
for more flexible labeling schemes. Moreover, a significant 
advantage of the Schroedinger Eigenmaps technique is 
that it applies not just to surrounding pixels, but to any 
image location allowing for classes whose members are not 
spatially connected. This more general approach enables 
its application to data that do not encode location as, for 
instance, gene expressions. 

The motivation behind this approach is two-fold. Math- 
ematically, we are interested in shifting the analysis of 
complex, high-dimensional phenomena towards studying 
the associated operators acting on these data-dependent 
graphs, and away from analyzing pointwise relationships 
between the graph nodes. Physically, partially labeled sets 
naturally lead to the notion of barrier potentials and to the 
way in which these potentials affect the diffusion processes 
on graphs. As such, by appropriately choosing locations of 
the potential barriers we can steer the diffusion process to 
allow for identification of the correct cluster containing the 
labels, when compared to unobstructed diffusion process of 
the Laplacian Eigenmaps algorithm. 

We would like to point out that [1] and ifTTIl deal 
with Schroedinger equations with Hamiltonians, which are 
Laplacians without any potential. We, on the other hand, 
deal with Schroedinger operators, which are Laplacians 
with barrier potentials; as such, these are two different 
generalizations of the Laplace operator approach. 

A. From Laplace to Schroedinger 

The matrices L and C in the previous section can be 
associated with the notion of the Laplace operator A, 
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cf. El. The related diffusion equation dt^ = /S.(^ is 
extended to the Schroedinger Equation 

dt^l){x, t) = Atlj{x, t) + v{x)^l){x, t) (2) 

by adding a potential term v{x) to the Laplace operator. 
We revisit this idea in the following section to introduce a 
discrete Schroedinger operator. 

B. Schroedinger Eigenmaps 

The potential v in (|2]) is considered as a nonnegative 
multiplier operator. The discrete analogue of f = A + v is 
the matrix E = L , where F is a nonnegative diagonal 
m X m potential matrix. We replace ([TJ with 

min trace(?/^(L + aV)^), (3) 

y^Dy=I 

which is equivalent to 

min [trace(^^L^) + trace(^^aV^)]. 

y^Dy^I 

The parameter a > is added here so that it can be used to 
emphasize the relative significance of the potential V with 
respect to the graph Laplace operator. 

According to O, the minimization problem ^ is equiv- 
alent to 

min lY.\\y,-yjfw.^j+aY,Vmyi\\', (4) 

y^ Dy=I ^ "—^ 

where V = diag(F(l), . . . , F(m)). The first component of 
the above sum incurs a penalty, when neighboring points xi 
and Xj are mapped into points yi and , respectively, which 
are far apart. The second component penalizes these points 
z = 1, . . . , m, which are associated with large values of 
V{i). Alternatively speaking, if V took only two values, 
and 1, then the minimization (|4]) yields a dimension- 
reduced representation y, which forces increased clustering 
of the representations yi of points associated with the value 
Vii) = 1, while attempting to ensure that close points 
remain close after the dimension reduction. As such we may 
utilize the potential V to label points which we want to be 
identified together after the dimension reduction. Because 
of the built-in preservation of topology of the point cloud 
(induced by the Laplacian), this labeling may be used to 
segment a particular class of points. We shall make this 
intuition clear in the following sections, see, e.g.. Corollary 
lin.2l and Corollary lin.4[ or the applications in Section [IVl 
It is known that the (rescaled) graph Laplacian converges 
towards the Laplace-Beltrami operator on the underlying 
manifold, e.g., (H, ll44l . This convergence carries over 
to the Schroedinger operator as we shall see next. The 
Laplacian map is given by 

j j 
For the purpose of this analysis we choose k = m, i.e., we 
assume that all data points are connected. Replacing the Xi 
by an arbitrary x G yields the extension to R^: 

LUi^) = m J2 e — ^ - E — ^ • 

j j 



It turns out that there exists a positive constant C such that 
for i.i.d. uniformly sampled data points {xi, . . . , Xm} and 
am 4m"-+2+% where s > 0, and / G C^(7W), we have 
the convergence 

lim C^^^ = AMf{x) (5) 

m^oo m 

in probability, cf. (H. Let v be a given potential on the 
manifold M. The associated matrix V acting on a discrete 
m-point cloud is defined sls V = diag(v(xi), . . . , v{xm))- 
Since the potential does not depend on a, we may replace 
Xi by an arbitrary x G R^ to obtain the map: 

Vmf{x) = V{x)f{x). 

Clearly this extension coincides with the continuous poten- 
tial on the manifold. As such, adding the discrete potential 
Vm to the discrete Laplacian does not impede the conver- 
gence in (O. Consequently, the term 

^ (Tra^)-^ + Vmfix) (6) 

m 

converges for n ^ oo towards 

AMf{x)^v{x)f{x) 

in probability. For more results on convergence of the 
Schroedinger Eigenmaps algorithm to Schroedinger oper- 
ators on manifolds, we refer the interested reader to [|33l 
and El. 

We close this section by noting that expression (|6]) 
induces a specific choice of the parameter a. Indeed, in 
order to consider E = L -\- aV, rescaling of in © 
implies the need to reversely rescale V. This can be done 
by means of multiplication with a = 4 ^ ^ , o , which 

(Tram) 2- 

tends to infinity as m ^ oo. As such, optimal selection of 
a is related to the size of the dataset. 

C. General Properties 

In this section we study the properties of the 
Schroedinger Eigenmaps (SE). We shall see that the po- 
tential can be used to push vectors towards zero, which 
can be helpful in classification tasks. In fact, we shall see 
later that even nondiagonal "potentials" make sense and, 
thus, address a more general setting where E = L-\- aV is 
symmetric positive semi-definite. Note that E can still be 
diagonalized and all eigenvalues are nonnegative. If V 7^ 
is a nonnegative diagonal potential, then 1 null(]/). For 
a connected graph Q, the latter implies that E = L -\- aV 
has full rank, i.e., there is no eigenvalue that equals zero. 

To study the effect of the potential V on eigenvectors 

of the Schroedinger operator, we make use of the notation 

WyWj^ := tTSice{y^ My) for a matrix M and we denote 
V:=L>-i/2yL>-i/2. 

Theorem III.l. Let V be symmetric positive semi-definite, 
n < dim(null(y)), and let y^'^^ be a minimizer of ([3]). 
a) There is a constant Ci > such that 

lb("^fy<Ci-. (7) 
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b) If z^^^ is the columnwise orthogonal projection of 
^{a) ^ D^/'^yM onto null(V) ^ {0}, then there is 
a constant C2 > such that 



(8) 



Proof: First, we address part (a). Applying z = D^^'^y 
to ([3]) yields 



min trace(z {C-\-aV)z). 

z^z=l 

Since V is symmetric, (O can be rewritten as 



(9) 



min tTdiCe{z^ Cz -\- z\.±aVz^^ii±)^ 

^nuiiGnull(V), z^^,,enu\\iV)^ 

where Znuii is a matrix whose columns are contained in 
null(V). The constant 

Ci := ^ min trace(Znuii>C2;nuii) 

^null^null = ^ 

2;nuiiGnull(V) 

only depends on the nullspace of V, and it is associated 
with the choice z^^ii± = 0. Since y^^"^ is a minimizer of ©, 
= D^/'^yM is a minimizer of (O. Obviously, we have 
+ Wz^'^^Wly < Ci, and according to ||^^")||^ > 0, 
we obtain ||^^"^||^v < ^i- This finally yields Wz^'^^Wl < 
Ci^. Since ll^^^^ll^ = ll^^^^lly, we have proven ©. 
In order to verify dS), we will use 

^ null^null ~ ^ ^ null^^null^* 

Note that V is symmetric positive semi-definite and there- 
fore II • II V is a norm on null(V)^. Since all norms on a 
finite dimensional vector space are equivalent, there is a 
constant C > such that || • ||/ < C|| • ||v on null(V)^. By 
applying C2 -= C^Ci, we get the estimate 



^(«) ||2 



> n- 



>n 



C'\\z^-% 
a 



'I nulJ 



11?^ 



C'\\y 



Let us now state the consequences of Theorem IIII.ll for 
a diagonal potential V. 

Corollary III.2. Under the assumptions and notation of 
Theorem IIII.ll let V = diag(vi, . . . , Vm), then we have, 
for a// i = 1 , . . . , m, 



i=l 



1 

a 



Remark III.3. Let V = diag(l, 0, . . . , 0) , then y[''^ is 

pushed to zero according to Corollary IIIL2I If xi and X2 

were sufficiently close, then we expect that 7/2"^ is also 
pushed to zero due to the linkage induced by L. 

Corollary IIII.2I means that a diagonal potential pushes 
entries of to zero. On the other hand, the potential 
does not force any collapse: 



Corollary III.4. Under the assumptions and notation of 
Theorem IIII.ll let V = diag('Ui, . . . , Vm)- 

a) If vi^ . . . ^Vr ^ and Vr-\-i ^ . . . ^Vm =0, then 

m ^ 

E >n- max {^}C,-. 

^ — ' 2=1,. ..,r a 

i=r+l 

b) If Vi = Dii^ . . . = Dr^r (^^d I'r+l, • • • 7 =0. 

then 

m ^ 

i—r-\-l 

Proof: Part (b) is a special case of part (a). To prove 
part (a), we apply Corollary IIII.2I which yields 



E^Mbf^f = E 



4"^f = ii4:iJi? 



i=r+l 



i=r+l 



r 

> n — max { } v^, 



(ck)||2 



>n— max l^^^jCi— . 



Corrollary IIII.2I says that penalized locations are pushed 
to zero, and Corollary IIII.4I ensures that not all unlabeled 
data are collapsed into zero. Thus, labeling can lead to a 
separation between labeled and certain unlabeled data. 

D. Schroedinger Eigenmaps: Nondiagonal Potentials 

While a diagonal potential V is used to overweight or 
underweight certain nodes yi, it turns out that nondiagonal 
"potentials" are useful as well. For i ^ j, let F^*'-^) be 



the matrix with entries V, 



1 and V, 



-1 and zeros elsewhere. One then verifies that 



tTace{y'^ aVy) = a\\yi - yjf, 
and Theorem IIII.ll yields 



I (a) (a)||2 

Ivi -y) II 



<ai. 

a 



In other words, this kind of potential allows to identify 
points. Adding the matrix F^*'-^) to the Laplacian has the 
flavor of modifying the underlying graph Q by reweighting 
its edges. It allows for modifying the weights Wij, i j, 
and we are able to penalize distances between data points 
which are not neighbors. We note that a similar change 
in the graph structure results from warping or weighted 
diffusion as proposed e.g., in fTTl or BTI . 

Remark III.5. If the graph Q is connected, then E = 
L + aV^'^'^^ has a simple eigenvalue equals zero since 
1 G null(F*^*'-^^). Thus, we should use the constraint 
y'T]ji/2^ = for SE with potentials V^'^^\ To estabHsh a 
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(a) (left) Original arc in three-dimensional 
space, (right) Laplacian Eigenmaps per- 
fectly recovers the two-dimensional struc- 
ture. 




-1 1-1 1-1 1-1 



(b) The Schroedinger Eigenmaps with diagonal potential V = 
diag(0, . . . , 0, 1, 0, . . . , 0) only acting in one point in the middle 
of the arc for a = 0.05, 0.1, 0.5, 5. This point is pushed to zero. Since 
V is 'discontinuous', the dimension reduced arc gets more and more 
discontinuous until will become zero for sufficiently large a. Then 
the arc does not deform any more. 




(c) By labeling the end points of the arc within the nondiagonal potential 
for a = 0.01,0.05,0.1,1, we are able to control the dimension 
reduction such that we obtain an almost perfect circle (up to a small 
discontinuity). 



Fig. 1. Schroedinger Eigenmaps applied to an arc 

uniform methodology of Schroedinger Eigenmaps, we use 
the constraint D^^'^w = throughout, where w is an 
eigenvector of the Schroedinger operator E = L -\- aV, 
with respect to the smallest eigenvalue. Thus, as in the case 
of Laplacian Eigenmaps, we skip the smallest eigenvectors 
and take the n subsequent ones. 

To demonstrate the use of the barrier potential, SE is 
applied to a three-dimensional arc in Figure [T] 

Remark III.6. For the purpose of sequentially identifying 
points, we propose to use the potential 

m — l 

V=Y^ V(^'^+^\ (10) 

which penalizes L by Yl7=i^ WVi ~ Note that the 

eigenvectors of this matrix are the basis vectors of the 
discrete cosine transform of type II. 

E. Schroedinger Eigenmaps Eor Classification 

In this section we discuss the standard Vector Angle 
Classification (VAC) in combination with Schroedinger 
Eigenmaps. In this regard, suppose we have s pairwise 
different seed vectors ai,...,as G M^, which represent 
the centers of each class. The class Ci consists of ^-"^s 
whose angle is closest to the specific center vector as long 
as the angle is smaller than some tightness parameter Si . We 
additionally apply a threshold parameter S to refuse ^-"^s 
that have small norm, which by themself form another class. 



According to Corollary IIII.2I SE can be applied to 
"force" processed pixels to zero at the rate l/^a and, 
hence, below any threshold ^ > for sufficiently large 
a. A practitioner can use labels in the form of a diagonal 
potential to separate a particular region of an image, see 
Section IIV-BI On the other hand, different spatial regions 
can be identified with each other by labelling them with 
non-diagonal potentials F*^*'^), cf., (fTOl) . This is useful to 
classify pathological areas as one single class despite the 
fact that intrinsic intra-class variations can be large. 

As it was mentioned in the introduction, a very consid- 
erable effort has been aimed at developing partially labeled 
classification schemes based on the Laplacian Eigenmaps 
algorithm, on related graph random walks, or on Diffusion 
Maps. The classification scheme based on Schroedinger 
Eigenmaps with partially labeled data is novel. When 
compared to "warping" and function adapted diffusion 
processes ifTTIl t41 L Schroedinger Eigenmaps result in a 
more general class of matrices (operators), by allowing 
modifications to the diagonal as well as to the off-diagonal 
terms. These diagonal terms, in turn, are associated with 
locations of the barrier potential which affects and steers 
the diffusion process. On the other hand, Schroedinger 
Eigenmaps classification is a joint optimization process for 
labeled and unlabeled data, as opposed to classical semi- 
supervised algorithms Q, 

IV. Biomedical Applications 

The performance of Schroedinger Eigenmaps on three 
standard medical datasets is compared with linear and 
Gaussian kernel Support Vector Machines (SVM). After 
having verified the usefulness of Schroedinger Eigenmaps, 
we apply them to analyze new problems in genetic expres- 
sion analysis and retinal multispectral imaging. 

A. Three standard bio-medical datasets 

All 3 standard datasets are obtained from the UCI 
Machine Learning Repository. Training points are randomly 
selected. The potential is designed on the training data and 
SE are computed on the entire dataset so that VAC with 
threshold leads to the final class separation. 

After removing missing values, the Wisconsin Breast 
Cancer Database (WBCD) Bvl contains 683 patterns with 
9 attributes and is grouped into two classes, benign and 
malignant. The diagonal potential is used on the benign 
class in the training data. The processed vectors that fall 
below a threshold form the predicted benign class on the 
entire dataset. 

The Cleveland Heart Disease Database (CHDD) |[22ll 
contains 297 patterns with 13 attributes and is grouped 
into presence of heart disease (values 1,2,3,4) and absence 
(value 0). We removed 6 patterns due to missing values. 
Following the focus in the literature, we aim to separate 
absence from heart disease. We label absence in the training 
data using the diagonal potential and identify heart disease 
classes 1-4 by applying the nondiagonal potential. The 
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predicted absence class is formed by the vectors that fall 
below the threshold. 

The Mammographic Mass Data Set (MMD) fXh con- 
tains 961 instances and 5 attributes that are grouped into 
benign and malignant. Note that we removed 131 patterns 
due to missing values and suppressed the first attribute as 
it was a numerical assessment of the range between benign 
and malignant of a double-review process by physicians. 

In our computations of SE, we keep 6 eigenvectors, and 
the number of nearest neighbors varies between 6 and 20, 
the weight parameter a in the potential, and the threshold 
parameter are optimized for minimizing the error rates. 
We compare our method with Laplacian Eigenmaps (same 
parameters as for SE) and with SVM. The Matlab imple- 
mentation of SVM is used with a linear and a Gaussian 
kernel function, and the separating hyperplane is found 
from least squares or a one-norm with soft-margin svm; the 
best result is taken. The parameter in the SVM Gaussian 
kernel corresponds to a in Schroedinger Eigenmaps. After 
optimization of a in SVM for each dataset separately, it 
appeared consistent that best results for both, SVM and 
SE, were obtained by choosing a = 1/2,1,2 in SE and 
2cr in SVM for the datasets WBCD, CHDD, and MMD, 
respectively. 

Error rates for large training data yield information on 
the nonlinear data structure. For instance, as the error of 
the linear SVM applied to WBCD with 600 training points 
does not vanish, the data cannot be separated in a linear 
fashion, see Table ID 

The comparison to Laplacian Eigenmaps (same parame- 
ters as for Schroedinger Eigenmaps) shows how the poten- 
tial can improve the classification process. If there are only 
few training data, then Schroedinger Eigenmaps lead to the 
smallest error rates among all methods, most considerably 
for CHDD and MMD. When increasing the number of 
training points, the Gaussian kernel SVM yields the best 
results except for MMD. Typical bio-medical data allow 
for only few reliable training data and in this situation 
Schroedinger Eigenmaps appear most useful. 

B. Applications to new biomedical data 

Up to this point we have compared Schroedinger Eigen- 
maps with other methods for semi- supervised classification. 
We believe the strength of our proposed technique is three- 
fold: good performance with few labeled points, a global 
approach yielding good local discrimination, and ability to 
correctly classify in the presence of non-linear mixing. In 
this section we shall illustrate some of these points with two 
examples: a classification problem in medical imaging, and 
a clustering problem in genetic analysis. At the same time, 
these two examples give additional evidence for the results 
of the first part of our paper. 

1 ) Analysis of clinical, retinal images: First, we ana- 
lyze a problem in retinal imaging, where our goal is to 
detect and differentiate between two classes of chemical 
mixtures in the human retina. This problem is of interest 
to the medical community, as it appears in studies of age- 
related macular degeneration (AMD) - the leading cause 



dataset, #training data 


LE 


SVM lin. 


SVM Gauss. 


SE 


WBCD, 40 


14% 


5% 


5% 


4% 


WBCD, 100 


9% 


4% 


4% 


3% 


WBCD, 200 


- 


4% 


3% 


3% 


WBCD, 400 


- 


4% 


2% 


3% 


WBCD, 600 


- 


4% 


2% 


3% 


CHDD, 40 


42% 


21% 


19% 


15% 


CHDD, 100 


- 


17% 


15% 


12% 


CHDD, 200 


- 


16% 


11% 


12% 


CHDD, 297 


- 


15% 


9% 


11% 


MMD, 20 


45% 


24% 


24% 


22% 


MMD, 30 


40% 


22% 


22% 


21% 


MMD, 40 


38% 


22% 


21% 


20% 


MMD, 100 




21% 


20% 


20% 


MMD, 200 




20% 


19% 


20% 


MMD, 800 




20% 


18% 


20% 



TABLE I 

Error rates as incorrectly classified samples / total data 

are averaged over 100 instances for each method. le is 
combined with vac, where training data are used as seeds 
that always end up in the correct class. therefore, the le 
outcome for large training data is not due to the le 

METHOD 



of blindness among the elderly population in the developed 
world, see [[111, O, |[T9l, ESI. The balance between these 
two chemical mixtures, which we want to detect, can be 
used to determine the stage of the disease and, in turn, the 
proper course of action ||3T], [|32l , [|45l . There is demand 
in the medical community for automated analysis tools that 
enable detection and classification while, at the same time, 
allowing for expert input [l24l. [|34l. [|38l. 

We shall apply our Schroedinger Eigenmaps technique, 
with the locations of the potential chosen by medical 
experts. This example further illustrates that relatively few 
(less than 0.1%) labels may lead to good classification 
results. An added difficulty in this scenario is that, due to 
limitations of the imaging device, the spectral information 
about the chemicals of interest is always nonlinearly mixed 
with various other responses, e.g., from layers just beneath 
the retina, or from different illumination patterns. As such, 
most of the mixing models that rely on presence of pure 
pixels in the image fail to work. 

For each patient, 4 excitation with 2 emission filters and 
trifold imaging lead to 24 images (400 x 400 pixels) that 
are aligned by applying the commercial software i2k Align. 
One pixel, therefore, has 24 entries in z-direction, so that 
the acquired dataset are 160000 vectors in 24-dimensional 
space. The earliest clinical signs of AMD are bright spots 
in retinal reflection images, see Fig. I2a), referred to as 
drusen ^ . We have applied Schroedinger Eigenmaps with 
diagonal potential acting on a small predefined drusen area 
(50 pixels) that was identified by medical experts. The 
number of nearest neighbors was chosen to be 14, <j = 1, 
a in the potential is 10 times the average norm of the data 
vectors, and we used 15 eigenvectors. We obtain a drusen 
map of all pixel vectors whose Schroedinger Eigenmaps 
coordinates fall below a threshold that was adjusted through 
visual inspection, cf. Fig. EJb). 

It was suggested in 1211 . that drusen centers may separate 
from outer drusen areas. To distinguish the two drusen 
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(a) drusen appear as bright 
spots; one is marked by the 
green square; detection and fur- 
ther subclassification is needed 




(c) blue area (75 pixels) 
was additionally identified 
by SE (non-diag. matrix); 
VAC with threshold yields 2 
well-separated drusen classes 
(black/red). Consistent with 
(2TI, we separate drusen 
centers from outer drusen 
segments 



(e) The class identified in (b) is 
further subclassified by running 
linear SVM on this subset with 
labels on the drusen center 




(b) only a small drusen region 
(50 pixels) in the upper left cor- 
ner was penalized in the barrier 
potential providing one class of 
drusen 



(d) zoomed onto the marked 
drusen in (a); red lines show LRW 
segmentation; drusen center and 
outer drusen area were separated 
by using 3 sets of labels; drusen 
center (blue), drusen outer area 
(yellow), background (green) 



(f) Gaussian SVM with labeled 
data as in (e) applied to the 
class in (b) 



Fig. 2. Classification with Schroedinger Eigenmaps, LRW verification, and 
comparisons to SVM based on multi- spectral retinal image sets 



subclasses, we additionally form a second potential on a 
drusen center (75 pixels) that identifies pixels and run SE 
simultaneously with both potentials activated. We obtain 
two subclasses of drusen, one by pixels whose Schroedinger 
Eigenmaps coordinates fall below the threshold, the other 
one is derived from taking the average over identified pixel 
vectors as a seed in VAC. Threshold and class tightness 
parameter are optimized through visual inspection, see 
Fig. [2c). The same label locations are used for linear and 
Gaussian SVM that are applied to the detected drusen in 
Fig. Ob) for further subclassification, see Fig. I2e,f). 

In a single drusen, members of the same class are 
spatially connected, so that we can apply the segmentation 



tool Laplacian Random Walk (LRW) in (301, confirming 
the drusen subclassification, cf. Fig. [2d). SE appears well- 
suited to incorporate spatially unconnected regions, going 
beyond image segmentation. 

2 ) Analysis of gene expressions: Recent efforts in biol- 
ogy focus on the understanding of genetic datasets based 
on complex gene interactions, rather than analyzing genes 
individually. This shift of perspectives increases the impor- 
tance of small local variations, which may be irrelevant 
from a purely global perspective. Therefore, analysis and 
classification of time series gene expressions require the 
balance between local and global variations. 

Hierarchical clustering in [[T2l and Laplacian Eigenmaps 
in I25I was used to tentatively analyze a collection of 8316 
microarray gene expressions at 8 time-points related to eye 
morphology in mice, see lITOl . HSl for further background. 
The goal of those studies was to identify new gene clusters 
within this dataset that are associated to eye development. 

We used a barrier potential in [|26l to label certain DNA 
specific proteins (transcription factors) and found a set 
of new genes related to eye morphogenesis, which were 
not discovered by traditional hierachichal techniques. Thus, 
even a preliminary application of the barrier potential for 
data labeling proved useful in L26I to add complementary 
expert data enabling improvements in gene clustering. 

V. Conclusions 

We have introduced a generalization of Laplacian Eigen- 
maps to allow for expert data in form of a potential 
on a data-dependent graph. The properties of our novel 
Schroedinger Eigenmaps are studied, and we illustrate 
their action on both, artificial and standard medical data. 
At this stage these are only illustrations, and providing 
mathematical formalism behind these results is part of our 
on-going and future work on this subject. It appears that our 
method is robust and particularly useful if the data quality is 
low and there are only few training data, a typical situation 
in bio-medical applications. In addition, our scheme is 
successfully applied to the analysis of retinal multispectral 
images. We classify drusen based on few training pixels 
that were labeled by a physician. Our results suggest that 
Schroedinger Eigenmaps are useful in high-dimensional 
bio-medical data analysis when limited expert knowledge 
is available. 
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